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Abstract 

We develop a coagulation-fragmentation model to study a system com- 
posed of a small number of stochastic objects moving in a confined domain, 
that can aggregate upon binding to form local clusters of arbitrary sizes. 
A cluster can also dissociate into two subclusters with a uniform probabil- 
ity. To study the statistics of clusters, we combine a Markov chain analysis 
with a partition number approach. Interestingly, we obtain explicit for- 
mulas for the size and the number of clusters in terms of hypergeometric 
functions. Finally, we apply our analysis to study the statistical physics 
of telomeres (ends of chromosomes) clustering in the yeast nucleus and 
show that the diffusion-coagulation-fragmentation process can predict the 
organization of telomeres. 

1 Introduction 

Coagulation-fragmentation modelings have been applied to various complex sys- 
tems evolving at various scales ranging from star formations to polymer orga- 
nization. Although coagulation of a large number of particles is described by 
continuum variable, for a system composed of a finite number of stochastic par- 
ticles, Marcus-Lushnikov process [T] can be used. This approach is based on 
Markov processes and combinatorial stochastic processes [2J [3J 21 [5] . 
In this letter, we study the dynamics of finite number of particles undergoing 
coagulation-fragmentation in a confined domain. Two stochastic particles bind 
with a Poissonian rate kf while a cluster can coalesce with any other cluster or 
particle at the same rate kf, or dissociate at a rate (n — l)kb, where n is the 
number of particles in the cluster and kb is the backward rate constant. The 
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dissociation gives rise to two clusters of size p and n — p, where the law of dis- 
sociation uniform. We develop a Markov analysis to compute the probability of 
a distribution of K clusters and obtain the steady state distribution, the mean, 
the variance, the size of clusters and the number of particles per cluster. Finally, 
we apply the present model to the organization of the 32 telomeres in the yeast 
nucleus. 



2 Analysis of cluster dynamics 

We consider N stochastic particles located in a confined domain which can 
interact following the rules described above (Fig. QJV). Our goal is to compute 
the probability density function 

P K (t) = Pr{ to have K clusters } (1) 

that the particles arc distributed in K clusters at time t. It satisfies a Markov 
chain that we shall now derive: the probability of having K clusters at time 
t + At is the sum of the probability of starting at time t with K — 1 clusters and 
one of them dissociates into two smaller ones plus the probability of starting 
with K + 1 clusters and two of them associate plus the probability of starting 
with K and nothing changes (Fig. Q}3). The first probability is the product of 
Pk—i($) by the transition rate f^-iAt to go from the state where there are 
K — 1 clusters to K, while the second term describes the transition from K + 1 
clusters to K. It is the product of Pk+i by the transition rate Cff+iAi. For a 
set of K indistinguishable clusters, the number of pairs is equal to K % — — and 
the association rate is 

ok = '-k f , (2) 

where kf is the encounter rate of two particles if there is no other particle. In 
contrast, for a cluster of size n, the dissociation rate is (n — l)fcf,. When there 
are K clusters of size n.j, the ensemble of configuration is (ni, . . . , %), with the 
conservation of the number of particles n i ~ N. We further considered 

that clusters are ordered by size ri\ > • • • > % > 1. The total transition rate 
from K to K + 1 clusters is the sum over all possible dissociation rates 

K 

Thus the Master equation for P K (t) is [7J B H] 

f Pi(t) = -fiPi(t) + c 2 P 2 (t) 

P K (t) = -(c K + f K )P K (t)+c K+1 P K+1 (t) + f K -iPK-i(t) (4) 

I P N (t) = -c N P N (t) + f N -iP N -i(t). 
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Figure 1: (A) Coagulation- fragmentation of a finite number of particles. Two 
particles aggregate at a rate kf and a cluster of n splits at a rate (n — l)kj,. (B) 
The Markov chain representation of the cluster dynamics where fx (resp. ck ) 
is the dissociation (resp. association) rate of a cluster (resp. of two clusters) 
when there are K clusters. 



To determine the number of clusters at steady state, we integrate explicitly 
the Markov chain ((4]) and the steady state probability 11^ = lim t _ ! . 0O Pj((t) is 
given by 

_(2a) K (N-l)\ 
Uk+1 - K\ {K + l)\{N-K-l)P U (5) 

where the equilibrium parameter is 

Using the normalization condition Y]jf ILk = 1, the probability II i can be 
expressed using hypergeometric series: 

Ul = 1 F 1 (-N+l;2;-2aY (?) 

where \Fi is the Kummer's confluent hypergeometric function (Eq. 13. 1. 2, [TP]). 
At steady state, the average number of clusters is 

N 

< M 0o (a,iV) >= Kn K = l + a(N-l)Gi(a,N) (8) 

K=l 

where 

1 F 1 (-jV + 2;3;-2 a ) 
Gl{a ' N) - 1 F 1 (-N + l;2;-2aY (9) 
In addition, the nth-order moment jj, n is given by 

^ H n ( 1 F 1 )(-N + l;2;z), z __ 2a , x 
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where H is the operator defined by H(f)(z) = -j^zf(z). Since the derivative of 
Rummer's function is 

d 1 F 1 (a; /3; z) a 

3 = ~siFi{a+ + l;z), 11 

dz p 

the moments can be written as 

(N-iy. (2a) fc 
= 2^(fc + l)!(iV-l-fc)! Jfel Gfe(a,iV) ' 



fc=0 



where 



n ( , n iFi(-JV + l + n;2 + n;-2a) 

G„ a, iV = ; ; 

V ; iFi(-2V + l;2;-2a) 



(12) 



and the coefficients oft are given by 



a t = < 



f ^;a(-iF (t+1 ^y +1) " i»i 8 eve„, 



(13) 



and «q = a " = Thus the variance of the number of clusters is given by 
<V 00 (a,N)> = /Li 2 -Mi 

= a(N - l)G\{a, N) + \a 2 {N - l)(N - 2)G 2 (a, N) - a 2 {N - l) 2 G\{a, 

o 

In addition, for large N and fixed a, using asymptotic results for hypcrgcometric 
functions [TO], we have 

(n 4- 1 V 

G n (a,N) a / — s (15) 
and we obtain from equation ((HJ that the mean number of clusters is 



<M 00 (a,N)>=V2^N + 0(l). (16) 

As a result of the above analysis, we can estimate the mean number of clusters 
< M oc (a, N) >, which is plotted as a function of a (Fig. [5JV) and N (Fig. 03). 

3 Size of the clusters 

Because the previous analysis cannot predict the size of the clusters, we shall now 
evaluate the probability density function for the number of clusters of different 
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Figure 2: Number of clusters < M oc (a, N) > as a function of the equilibrium 
parameter a (A), for N — 10 and N = 30 obtained from eq. (jHJ) and (B) as 
a function of number N for a = 5, using eq. ([8]) (continuous line) and the 
asymptotic approximation eq. (|16j) (dashed line). 



sizes. This analysis relies on carefully studying the configurations of N particles 
decomposed in K clusters. The configuration is described by the ensemble of 
ordered clusters of size m, . . . , tlk 

k 

Anj< = {( n ])i<j<i<;Y Ui = N > ni - ' ' ' - UK - 1 J'' 
»=i 

which is equivalent to the set 

N N 

A> NJ< = {(mi)l<i<N; ^ im% = N > X! m% = 
i=l i=l 

where m, is the number of clusters of size i. The correspondence from An.k to 
A' N K is obtained by taking into account that for all clusters rij of size i, we have 
the relation m, = J2j ^( n j = *) 12 • We enumerate the number of configurations 
associated with the ensemble A' N K . It is the same as counting the occurrence 
of the integer i in the decomposition of N as a sum of K integers. The number 
of occurrence of the configuration (mi, . . . ,mjv); when there are mi clusters of 
size 1, mo clusters of size 2 is the multinomial coefficient — — r . Thus 
the conditional probability of the (mi, ...,mpf) distribution, when the total 
number of clusters is equal to K, is obtained by normalizing the equilibrium 
over all possibilities 

K\ 

t w\ m 1 l...m N l , „x 

p(mi,...,m N \K) = . (17) 

T,( mi )eA' NiK mi i... mjv ! 
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Interestingly by summing the series (X + X 2 + ... + X N ) K for the N th order 
coefficient and using the N — K derivative of r^xW > we have 

V Kl = {N ~ 1)l (18) 

^ mi \...m N \ {K- 1)\(N - K)V y ' 



(mi)eA' r 



N,K 



At this stage, we shall explain the rational for expression (|17[) . Indeed, it corre- 
sponds to the equilibrium distribution of particles in clusters, where the dissoci- 
ation (resp. association) rate is proportional to the number of elements (minus 
one) (resp. the number of pairs of particles). 

This consideration defines the steady state. Indeed, if we consider the 
equilibrium probability distributions associated with the Markov chain config- 
uration (mi, . . . , m^v), then the transition between the two neighboring states 
(mi, . . . , mi — 1, . . . , raj — 1, . . . , nii + j + 1, . . . , m^v) and (mi, . . . , mjv), is ob- 
tained first from the coagulation of a cluster of size i with one of size j, with 
a rate C(i,j) = ^-mirrij if i ^ j and C(i,i) = ^-mi(m,i — 1) otherwise. The 
factor | accounts for the two cases C(i,j) and C(j,i). The fragmentation rate 
from i + j to (z, j) is F(i,j) — kb(m i+ j + 1) (Fig. [3]). Thus, the stationary 
probability n necessarily satisfies the relation 

7r(mi, . . . ,m, - 1, . . . ,m 3 ■ — 1, . . . , m l+j + 1, . . . , tun) _ C(i,j) _ 1 m^mj 



7r(mi, . . . , mjv) F(i,j) 2am,i + j + l 

When the configuration (mi, . . . , mjy) is made of K clusters, the configuration 
(mi, . . . , mi — 1, . . . , raj — 1, . . . , rrii + j + 1, . . . , m^) has K — 1 clusters. From 
Eq. (5), we have that 

n JC _ 1 (o) = n A: (o); 



(19) 



2{N - K + l)a 
and the conditioned stationary probability satisfies 

7r(mi, . . . , rrii — 1, . . . , mj — 1, . . . , m^j + 1, . . . , m^)\K — 1) Hk(cl) 1 mim 3 



n(mi, . . . ,m N \K) IlR-_i(a) 2a m l+j 

N — K + 1 m,m 



{20) 



K(K-l) m i+j 

A direct computation shows that the probability p{m\, . . . ,mjv) satisfies equa- 
tion (|2"Tj) |11) . Now using Bayes rule, the joint probability of the configura- 
tion (mi, . . . , m^r) and K clusters is the product of the conditional probability 
p(mi, . . . , mjvl-ftr) by the probability of having K clusters 

p{mi 1 ...,m Nl K) =p(m 1 , . . . ,m N \K)IL K . (22) 

We shall now estimate the size of clusters from the above considerations. For a 
total of N particles, when there are K clusters, the mean number of clusters of 
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(m 1 ,...,m i -1,...,m r 1,...,m i+j +1,...,m N ) 

A 

Coagulation (kfrnjirij) 

(m 1I ...,m N ) 

Dissociation (kb m i) 
(m 1 ,...,m i . k +1,...,m k +1,...,m i -1,...,m N ) 

Figure 3: Markov chain representation of the configuration dynamics. When 
the configuration is (mi, . . . , mjv), the dissociation rate of a cluster of size i into 
a cluster of size k and one of size i — k is kbmi , while the rate of formation of a 
cluster of size i + j from two clusters of size i and j is equal to -d-miirij. 



size n is 



< M n > K = ^2 m n p(mi\K) 

{N - n - 1)\K\{N - K)\ 



(N - 1)\(K - 2)\(N -n-K+l)V 



which we computed using equation (|18[) and that when a cluster of size n is con- 
tained in the distribution (mi, . . . , mjv), it is equivalent to have N — n particles 
in K — 1 clusters. Finally the mean number of clusters of size n is obtained by 
summing over all possibilities when there are K clusters, 



A' 



<M n > = ]T < Mn >K U K (23) 



K=l 



(N-n-iy. -^-y K(K - l)(N - K)\ 
(N-l)\ ^ (N-n-K + l)\ K ' 

Using the expression for XIk ©, we obtain 

<M n > = 2a K , ' ' — —!- if n < N, 

iF x (-iV + l;2;-2a) 

< M N > = 



(24) 



iFi(-iV+l;2;-2a) 



In addition, the mean number of clusters of size n is 2a rr n , 1 .V y ^ \ , i-e. this is 

' 111 ( A— n) ' 

the probability ratio of having one cluster when there are N particles over the 
probability of having one cluster when there are N — n particles. Finally, the 
variance of the number of clusters of size n is, if TV > 2n + 1, 

<V n > = <M n >-< M n > 2 +2a- 



(N -2n+l)(N -2?7 + 2) V n i(^-2") ni(A^-2r7~ 



if) 
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and otherwise 



<V n > 



< M n > - < M n > 2 . 



(26) 



To summarize this analysis, we plot in Fig. U the mean number of clusters of 
size n for five particles. We will also use this analysis in the final section to 
study telomere clustering in yeast. 
At equilibrium, particles are exchanged between clusters. To characterize 



Figure 4: Mean number < M n > of clusters of size n as a function of the 
parameter a (A) for N = 5 particles (equation (f2"4"|)) and (B) mean number of 
cluster fora = 0.1,a = 0.4 and a = 2 as a function of the cluster size. 

the exchange of telomeres between clusters, we now estimate the probability 
P2 {N, a) to have two given particles in the same cluster C for a fixed num- 
ber of particles N and equilibrium constant a. For a distribution of clusters 
(ni, . . . ,tik), the probability for two specific particles to be in C is obtained 
by choosing the first particle in the cluster rij, which is equal to the number of 
particles in the cluster divided by the total number of particles j£ and thus the 
probability to have the second particle in the same cluster is n ^Z\ ■ Summing 
over all clusters, we get 




A 



5r B 



a 



°1 2 3 4 

Cluster size 




(27) 




Using that ^2 




N 
i=l 



i 2 rrii, we have 




(28) 



Summing now over all distributions of clusters, the probability 

MN,a) = -_L_ + N±±,F l{ N + l;3 .^ 2a) 

F y ' ; N — 1 N — 1 iFi(—N + 1; 2; —2a) V 7 

Surprisingly, the three-term recurrence relation for Rummer's function f [10j 
Eqs. 13.4.1-13.4.6) gives iF x (-iV + l;3;-2a) = f=± x Fi(-iV + 2; 3; -2a) + 
7 ^ I1 F 1 (-N+l;2;-2a). Finally 

p 2 (JV,a) = Gi(a,iV). (30) 




Figure 5: (A) Number of clusters containing more than three particles 0(N, a) 
as a function of the equilibrium parameter a, for N = 32. Experimentally, 2.9 
clusters were observed in average, which corresponds to a = 0.2 and a = 15. 
(B, C) Distribution of the mean number of clusters as a function of the cluster 
size (Eq. ([21])). for TV = 32 and a = 0.2 (B) and a = 15 (C). Grey bars indicate 
single particles and clusters of two particles, which are not observable. 



4 Application to telomere clustering in yeast 

To apply the previous analysis to the organization of telomere clustering in 
yeast, we use a coarse-grained model of a telomere, the motion of which can be 
approximated as Brownian [T2]. In addition, because clusters are of small sizes, 
the arrival time of a telomere to cluster is Poissonnian. Indeed, it is the time for 
a stochastic particle moving on the nuclear surface to reach a small target. Our 
goal is now to show that telomere clustering can be described by our diffusion- 
fragmentation- association model and thus by the Master equation (£[]). For that 
purpose, we propose to estimate the forward and backward rates, constrained 
by recent observations that in yeast the 32 telomeres form 2 to 8 clusters, with 
an average of 2.9 clusters per cell, containing in average 4 telomeres [Tl] . 

For a telomere radius r = 0.015 /im, moving on a sphere (nucleus surface) 
of radius R = 1 /im with a diffusion coefficient D = 0.005 /im 2 .s _1 [15], we find 
using a Brownian simulation for two telomeres to meet, that the forward rate is 
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kf = 1.910~ 3 s _1 . Although the cluster can vary in size when telomeres attach, 
we shall make the assumption that the encounter will stay constant and use this 
rate in our previous Markov modeling. Indeed, telomeres are attached to the 
nuclear surface by a family of proteins Sir2/Sir3/Sir4 [T3J [T3] - Telomere clusters 
are elicited by the formation of Sir3-Sir3 interactions, and Sir3 abundance is 
directly related to the stability of the clusters (dissociation rate kb). Thus, in 
the absence of any experimental evidences, we consider that telomere diffusion 
is driven by a molecular complex, moving on the nuclear surface. When this 
complex has the shape of a cylinder, the diffusion constant is given by the log of 
the radius [IB]. Since clusters contain at most four telomeres [T3], the effective 
radius does not change much leading to a small change of the diffusion constant. 
This justifies our constant approximation for the encounter rate kf(m, n) of two 
clusters of size n and m. 

The dissociation rate fc& of a telomere from a cluster cannot be easily derived 
from experimental data because clusters made of one or two telomeres are not 
experimentally visible [TJ]. Using the results of the first sections (equations 
([5]).(|24p). we use our model Markov model to compute and to plot (Fig. [SK) 
the number of visible clusters obtained by the formula 

0(N,a) = <M oc (a,N)>-<M 1 (a,N)>-<M 2 (a,N)> 
= 1 - 4a + (a(N -l)+Qa 2 )G 1 (a, N) ~^a 3 G 2 (a, N) 

as a function of the equilibrium parameter a. We find from this graph that there 
are two possible values for a, giving an average of 2.9 visible clusters, which are 
a = 0.2 and a = 15. To select the correct value, we built the cluster distribu- 
tions associated for these two values from equations (|24|) . Wc further obtained 
the mean number of clusters of a given size (Fig. 03, C). However, for a = 0.2, 
telomeres form a giant cluster, containing almost all the telomeres, which is 
not reported experimentally. Finally, we conclude that a = 15, for which there 
are always less than five telomeres per cluster, in agreement with experimental 
observations [TJ]. Furthermore, wc obtain for the dissociation rate, the value 
kb = akf = 0.03 s _1 , which was not known before. Finally, using formula (|30[) , 
we can predict that two specific telomeres are located in the same cluster ap- 
proximately 4% of the time. 

In conclusion, wc obtained here a novel analysis for a coagulation-fragmentation 
process of a finite number of Poissonnian particles restricted in a bounded do- 
main. We used our model to estimate the dissociation rate kb of telomeres from 
a cluster, a constant that can reveal the local organization of telomeres in a 
cluster. Although we applied this analysis to the organization of telomeres in 
yeast, it could be applied to study telomere organization for general organisms. 
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